-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(bench): evaluate review writing in ReviewBench #917
Conversation
Codecov ReportAttention: Patch coverage is
@@ Coverage Diff @@
## main #917 +/- ##
==========================================
- Coverage 70.13% 63.87% -6.26%
==========================================
Files 80 82 +2
Lines 3569 3900 +331
==========================================
- Hits 2503 2491 -12
- Misses 1066 1409 +343
|
@timsanders256 is this PR ready to merge now? |
) | ||
self.rebuttals.append(rebuttal) | ||
yield rebuttal, self.leader | ||
# self.rebuttals: List[Rebuttal] = [] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
delete this directly instead of commenting.
print(f'Token input count: {token_input_count}') | ||
print(f'Token output count: {token_output_count}') | ||
|
||
# # save all inputs and outputs |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
delete these comments.
token_output_count += token_counter(model=model_name, text=strength) | ||
token_output_count += token_counter(model=model_name, text=weakness) | ||
|
||
# # save all inputs and outputs |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
delete these comments.
generated_weakness: str, | ||
) -> Dict[str, List[float]]: | ||
metrics_raw: Dict[str, Any] = { | ||
# 'bleu': [], |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
delete these comments
Closes #
π Description
β Checks
type/descript
(e.g.feature/add-llm-agents
)βΉ Additional Information